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Abstract 

This paper refutes the vahdity of the polynomial-time algorithm 
for solving satisfiability proposed by Sergey Gubin. Gubin introduces 
the algorithm using 3-SAT and eventually expands it to accept a broad 
range of forms of the Boolean satisfiability problem. Because 3-SAT 
is NP-complete, the algorithm would have implied P = NP, had it 
been correct. Additionally, this paper refutes the correctness of his 
polynomial-time reduction of SAT to 2-SAT. 
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1 Introduction 



The Boolean satisfiability problem (SAT) deals with determining the satis- 
fiability of a Boolean expression. That is, given an expression of Boolean 
literals, "and" operators, "or" operators, and "not" operators, is there an 
assignment of the literals such that the formula is true? Within complex- 
ity theory, it is a well-known problem that has been heavily researched. It 
was one of the first problems to be shown NP-complete |Coo71j . Thus, a 
polynomial-time algorithm to determine the satisfiability of a Boolean ex- 
pression would imply P = NP. 

Many forms of the Boolean satisfiability problem exist. Despite artificial 
restrictions, most of these problems remain NP-complete. 3-SAT is one of 
these special variations. Restrictions are placed on the form of the Boolean 
expression, but the goal of determining satisfiability remains. 3-SAT expres- 
sions are written in conjunctive normal form as such: 

/ = ci A C2 A . . . A (1) 

Where Ck,k = 1, 2, . . . , m is a disjunctive clause of three or fewer Boolean 
literals. The following is a typical 3-SAT instance: 

/ = V J9 V r) A (^g V p V ^r) A V V r) A (^g V V ^r) (2) 

2 Gubin's Algorithm 

Sergey Gubin recently proposed a polynomial-time algorithm for solving 
3-SAT |Gub07aj . Following a brief mention of the reducibility of the NP 
complexity class to this NP-complete problem, a relatively straightforward 
matrix-trimming algorithm is presented to solve any general 3-SAT problem 
with 0{n^) time complexity in the number of disjunctive clauses. Further- 
more, Gubin reveals that his algorithm can be trivially adapted for general 
Boolean satisfiability problems in conjunctive normal form. 

Gubin later extends this work to show that his algorithm not only deter- 
mines the satisfiability of a Boolean formula in conjunctive normal form, but 
that the particular set of variables satisfying the instance can be recovered 
from the algorithm |Gub07cj . 



2 



In this section, we provide an overview of Gubin's algorithm along with 
an example of its use. Initially, we are given a SAT instance written as in 
Equation [H 

The fundamental mechanism of the algorithm involves doing pair-wise 
comparisons of clauses in order to iteratively "trim" a set of potential solu- 
tions. The problem is first rewritten in a triangular form: 

(ci A C2) A (ci A C3) A (ci A C4) A . . . A (ci A c^) 
(C2 A C3) A (C2 A C4) A . . . A (C2 A c^) 
(C3 A C4) A . . . A (C3 A Cm) 



Each pair of clauses in parentheses is then converted into a matrix. To 
build such matrices, Gubin introduces the notion of "compatibility," which 
he ultimately equates to pair-wise satisfiability; the matrix Cij denotes the 
"compatibility matrix" between clauses Cj and Cj, whose boolean entries rep- 
resent variable assignments satisfying (cj A Cj). For each matrix Qj, each 
entry ea,b corresponds to the variable assignments in row a of the truth table 
for Ci combined with the variable assignments in row b of the truth table for 
Cj. The matrix entry Ca^b is true if the following conditions are met: 

1. Row a of Cj's truth table is true, and row b of Cj's truth table is true. 

2. Any variables present in both q and cj must not have conflicting truth 
table assignments in rows a, b, i.e., if variable x is false in row a of the 
truth table for Cj, x must also be false in row b of the truth table for Cj. 

After the initial triangular array of matrices is constructed, the algorithm 
iteratively "depletes" each of the matrices in rows i through m with the clause 
Cj_i. "Depletion" is achieved by eliminating nonzero entries Cab in the matrix 
Cij whenever column a of matrix Ci_i_j and column b of matrix Cj_i j do not 
have any nonzero entries in the same row. 

To clarify the notions of "compatibility" and "depletion" , we first present 
examples of matrix construction and matrix depletion adapted from examples 
offered by Gubin |Gub07b] . Consider the clauses: 




C2 = V V -ir 



(3) 
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Cs^pV -^qV s (4) 
The corresponding truth tables for these disjunctive clauses are: 



(a) C2 (b) C3 
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Table 1: Truth Tables for Example 



Since each row denotes a unique combination of variable assignments, an 
8x8 matrix can enumerate all possible combinations of variable assignments 
between two clauses of 3 variables each. The matrix €2,3 is constructed by 
representing the truth table of clause C2 on the vertical axis, and the truth 
table of clause C3 on the horizontal axis. A matrix entry is determined to be 
mutually satisfiable if the truth tables for both clauses are true in the rows 
indexed by the matrix, and if there arc no conflicting variable assignments for 
those variables that are shared (if no variables are shared, then this second 
condition is automatically satisfied) . Applying this to the two clauses shown 
above results in the following matrix: 
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In the above matrix, element (2,2) (row 2, column 2) is true because the 
second row of the truth table for C2 is "compatible" with the second row of 
C3; both truth table entries are nonzero, and the shared variables p and q can 
be mutually satisfied by assigning both to false. 

Similarly, (3,3) is false simply because C3 has a zero in row 3 of its truth 
table. The element (5,4) is not compatible because row 5 of C2 requires p — I, 
g = 0, r = 0, while row 4 of C3 requires p = 0, q — 1, s — 1; both p and q 
cause incompatibilities with this assignment. 

Now, suppose we want to "deplete" the above matrix with the clause 
ci — pV qV r. This clause has the truth table: 
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With Gubin's algorithm, we would have computed the compatibility ma- 
trices for Ci^2 and Ci^s in the initial step, obtaining the following two matrices: 
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After depletion, the matrix C2 3 becomes: 
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Comparing to the original matrix, we see that the entries (1,1) and (1,2) 
have been ehminated. As a specific example, elimination of (1,2) was achieved 
by comparing column 1 of Ci^2 to column 2 of Ci^a, and finding that no 
nonzero entries existed in the same row of both columns. 

Note here that no additional rows or dimensions are added to the matrix, 
so we lose the information regarding which particular rows of Ci,2 and Ci^s 
are compatible with the resulting depleted matrix. 

In the algorithm, all matrices in rows i through m are iteratively "de- 
pleted" by the clause corresponding to row i — 1 for i ranging from 2 to m. 
Eventually, once all clauses have contributed to depleting bottom-right ma- 
trices, Gubin asserts that the bottom-right matrix corresponds to the entire 
string of disjunctive clauses. At this point, if there are no matrices with all 
zeros, Gubin's algorithm determines that problem is satisfiable. Similarly, if 
at any time there is a matrix containing only zeros, the algorithm determines 
the problem to be unsatisfiable |Gub07a] . 

The following is a brief walkthrough of an unsatisfiable 2-SAT example: 

/ = Ci A C2 A C3 A C4 = (-ip V g) A (p V -ig) A (p V g) A {^p V ^q) (5) 
Construct the truth tables: 

i^pyq) (pV^g) (pVg) (^P V ^g) 
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Applying compatibility rules to the truth tables, the following initial tri- 
angular array of matrices is created: 



C2 C3 C4 

1 11 

1 1 
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The second and third rows of matrices are depleted by Ci. 
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Finally, the third row is depleted with C2: 
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Since the bottom-right matrix is now all zeros, the algorithm has suc- 
cessfully shown that this 2-SAT problem is unsatisfiable. The same method 
is used for 3-SAT or general SAT problems, where each matrix will simply 
have varying dimensions depending on the size of each disjunctive clause. 
For more examples, refer to Gubin |Gub07b] . 



3 Comments and Analysis 

Gubin's analysis of asymptotic complexity is correct, in that the algorithm re- 
quires O(m^) matrix compatibility comparisons/depletions. Considering the 
relatively sparse matrices, an efficient implementation of the algorithm could 
even be developed. Unfortunately, the algorithm has significant shortfalls in 
other areas. 

An immediate sense of information loss emerges after reading the algo- 
rithm. Before depletion, the algorithm seems to work well, but during deple- 
tion, problems relating to this loss arise. During each iteration of depletion, 
Gubin's algorithm could be comparing two compatibility matrices that share 
just three literals. So, the sixty four bit compatibility matrix could be forced 
to store information on nine distinct literals. Informally speaking, this is not 
enough space to store adequate information on accepting assignments for this 
many literals. There are 2^ possible combinations of these literals, which is 
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far too much data to store in an eight by eight matrix. Modifying the size 
of the depleted matrix (by adding additional dimensions) could fix the algo- 
rithm, but it would also require an exponential amount of time. Regardless, 
this lost information will catch up to the algorithm and cause errors. 

Beyond this, the entire algorithm relies on the correctness of the notion of 
"compatibility" and "depletion" ; the question arises of whether a nonzero en- 
try in a depleted compatibility matrix does, in fact, correspond to a solution 
to an entire string of mutually satisfiable disjunctive clauses. 

Another prominent issue is the fact that, after depleting a matrix with a 
particular clause, no information is recorded as to which precise rows of the 
truth table for that clause are the ones mutually compatible with the entire 
matrix. Given that initial matrices require 64 (8x8) bits to store all configura- 
tions of up to six literals between two 3-SAT clauses, it seems counterintuitive 
that no extra space is needed to store the 2^ interdependent configurations of 
three clauses, 2^^ configurations of four clauses, or ultimately, 2"* configura- 
tions over the entire 3-SAT string. On the contrary, the algorithm computes 
the column-matchup corresponding to a valid configuration in each depletion 
stage, then discards this information at precisely the moment a brute-force 
method would require more information. 

Consider now a 3-SAT case with four disjunctive clauses. Suppose any 
three are satisfiable without the fourth clause, but that overall, the formula 
is not satisfiable. Since the algorithm is designed such that clause C2 is never 
used to deplete matrices built from Ci, clause C3 is never used to deplete 
matrices built from C2, etc, the only matrix that will eventually consider all 
four clauses is the bottom- right matrix, denoted 6*3,4. 

Suppose C3 and C4 have compatible entries, and that depletion by ci does 
not eliminate any entries from this matrix. However, those variable assign- 
ments for Ci which are able to match each of the entries in matrix C3 4 could 
very well be precisely those entries which require C2 to be false. If such a 
case exists, we immediately see that after depletion by ci, we discard the 
information regarding the requirements imposed by ci, and so the algorithm 
goes on to incorrectly predict that the problem is indeed satisfiable. 

Does such a 3-SAT instance exist? Yes, for an almost trivial case: 

/ = (a V 6 V c) A i^a) A (^6) A (^c) (6) 

Obviously, this is unsatisfiable. One of the three variables must be true, 
but a clause forbidding each one to hold a true value is present. This satisfies 
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the above assertion that any triplet of the clauses is satisfiable. 
Let's walk through Gubin's algorithm by hand. Truth tables: 
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Compatibility matrices: 
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During the first depletion step, the matrices remain unchanged; the last 
three clauses have no common variables, and the Ci 2 matrix causes no change 
in any matrix in row two or below. During the second and the final depletion 
steps, the same is true. So, no pattern of unsatisfiability arises. 

Interestingly, changing the order in which the clauses are specified causes 
this problem to not occur. If ci were specified last, then depleting on C2 would 
eliminate two rows, and depleting on C3 would eliminate the remaining one. 
Ci^4 would have no entries left and the algorithm's result would be correct. 
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However, just trying all possible orderings would require exponential time. 
The fact that the bottom-right matrix does not contain complete information 
about all clauses (it cannot, because conjunction is transitive and we have 
just shown that compatibility via Gubin's method is not) directly contradicts 
Gubin's assertion that it is logically equivalent to the conjunction of all the 
formula's clauses. 

4 A Previous Counterexample 

Hegerle presents a purported counterexample to Gubin's algorithm |Heg07 . 
Let B be the set of boolean variables in the original 3-SAT instance. Let C 
be the set of clauses in that instance. Hegerle asserts that Gubin's algorithm 
reduces to three patterns: 

1. 3a E B such that {a, -la} C C 

2. 3a,P e B such that a 7^ /3 and {a V /3, a V ^/?, V /?, V -■/?} C C 

3. 3a, P,-f e B such that a 7^ /3 7^ 7 and {a V V 7, a V /5 V ^7, a V V 
7,aV^/5V^7,^aV/9V7,^aV/5V^7,^aV^/3V7,^aV^/3V^7} C C 

Hegerle claims that Gubin's algorithm will deem a formula unsatisfiable 
if and only if at least one of the above is true. 

This claim is false. While it is correct that Gubin's algorithm only simul- 
taneously considers three clauses (two in a compatibility matrix and one on 
which that matrix is being depleted), some information about other clauses 
is passed through the depletion process. As a result, Gubin's algorithm can 
handle some formulas which do not contain all possible clauses of a given 
length k, k < 3. 

Indeed, Gubin himself demonstrates how his algorithm functions in this 
case |Gub07bj . 

5 Addressing a Reduction of SAT to 2-SAT 

Gubin also presents a polynomial-time reduction of SAT to 2-SAT |Gub07d] . 
In this reduction, he first converts the 3-SAT instance to the form g A h = 
true, where g is a. conjunctive combination of clauses and /i is a conjunctive 
subset of the clauses therein. So, g A h = true may be viewed as a SAT 
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instance. For brevity, let us ignore the method he employs to perform this 
conversion, as it is not where the flaw presented herein lies. 

After producing g and Gubin applies the matrix-compatibility method 
described in his previous paper |Gub07a] to this new equation. He asserts 
that the bottom-right compatibility matrix after the depletion steps is all 
zeros if and only if the original formula is unsatisfiable. However, we have 
shown this to not always be the case. 

When the bottom-right matrix contains nonzero entries, Gubin then enu- 
merates it such that each element is labeled yi. Finally, he creates a set of 
new clauses t],, each containing solely the literal hi ii yi 1, or if yi is 0. 
This is a 1-SAT instance. 

Now, back to our objection. We have already shown that the final matrix 
after depletion may contain nonzero entries even when the 3-SAT instance 
cannot be satisfied. If this is the case, Gubin will generate a 1-SAT instance 
where each variable is only contained in a single clause, and only once within 
that clause. Such a problem is always satisfiable: just assign a variable to be 
false if its one occurrence is negated, and true otherwise. The original 3-SAT 
problem was not satisfiable. Therefore, the reduction is not correct. 

It is possible that the way Gubin converts the 3-SAT instance to g A h 
eliminates all cases where his matrix-compatibility algorithm fails. However, 
no proof of this is presented within his paper |Gub07d] . The reduction, as it 
stands, is not correct. 

6 Conclusion 

While Gubin's algorithm does take polynomial time, it does not accurately 
determine satisfiability. Its underlying principles are reasonable, but the re- 
laxation of the problem at various stages dooms its validity. Keeping track 
of all possible Boolean assignments that lead to satisfiability at each stage 
would fix the algorithm. Unfortunately, this would require an exponential 
amount of time when using a method similiar to the one presented in the orig- 
inal algorithm. Gubin's algorithm fails to prove that P = NP. This remains 
one of the larger unanswered questions in theoretical computer science. 
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